TREC 2002 Web Track "Automated Word Sense Disambiguation for Internet Information Retrieval"

نویسندگان

  • Christopher Stokoe
  • John Tait
چکیده

We describe an attempt to use automated word sense disambiguation to improve the performance of an internet information retrieval system. A performance comparison of term frequency verses word sense frequency was carried out, the results of which indicated no significant performance gains from using a sense based retrieval model instead of the traditional TF*IDF.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UIC at TREC 2005: Robust Track

This paper presents a new approach to improve retrieval effectiveness by using concepts, examples, and word sense disambiguation. We also employ pseudo-feedback and web-assisted feedback.

متن کامل

TREC-9 Cross Language, Web and Question-Answering Track Experiments using PIRCS

In TREC-9, we participated in the English-Chinese Cross Language, 10GB Web data ad-hoc retrieval as well as the Question-Answering tracks, all using automatic procedures. All these tracks were new for us. For Cross Language track, we made use of two techniques of query translation: MT software and bilingual wordlist lookup with disambiguation. The retrieval lists from them were then combined as...

متن کامل

Cross Language Information Retrieval : a Research

Cross-Language Information Retrieval (CLIR) has been a research sub-field for more than a decade now. The field has sparked three major evaluation efforts: the TREC Cross Language Track which currently focuses on the Arabic language, the Cross-Language Evaluation Forum (CLEF) – a spinoff from TREC covering many European languages, and the NTCIR Asian Language Evaluation (covering Chinese, Japan...

متن کامل

Experiment Report of TREC 2005 Genomics Track ad hoc Retrieval Task

This report describes the experiments we have conducted on the ad hoc retrieval task of Genomics track at TREC 2005. In the experiment, a number of different techniques were employed, including Porter stemming, MeSH term and gene name identification, Okapi, weighting schemes, query expansion, and concept-based ranking strategy. The results on sample topics are reported. Future improvements, suc...

متن کامل

Word Disambiguation in Web Search

Internet is huge like a sea as the amount of information is growing rapidly on WEB. Whenever user searches something on Internet the Search Engine provides an incredible amount of information that increases the complexity of dealing with information. Various algorithms have been developed that help the user to retrieve the web contents. Sometimes these algorithms do not give fruitful results es...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002